Local and Global Distinguishability in Quantum Interferometry 
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A statistical distinguishability based on relative entropy characterises the fitness of quantum states for phase 
estimation. This criterion is employed in the context of a Mach-Zehnder interferometer and used to interpolate 
between two regimes, of local and global phase distinguishability. The scaling of distinguishability in these 
regimes with photon number is explored for various quantum states. It emerges that local distinguishability is 
dependent on a discrepancy between quantum and classical rotational energy. Our analysis demonstrates that 
the Heisenberg limit is the true upper limit for local phase sensitivity. Only the 'NOON' states share this bound, 
but other states exhibit a better trade-off when comparing local and global phase regimes. 

PACS numbers: 42.50.St,42.50.Dv,03.65.Ud,06.20.Dk 



Interferometry may be viewed as estimation of a finite 
phase parameter from a position of prior ignorance (U. It 
is also used to identify small changes in a known phase, or 
to track such changes over time. The tasks of global phase 
acquisition and local phase tracking are both important chal- 
lenges. Classically, the distinction is well-understood, for ex- 
ample in implementations of Radar/Sonar J2]. Any compre- 
hensive analysis of a quantum interferometer should address 
these two different facets of metrology J^. 

Real interferometers always have trade-offs between per- 
formance (e.g. accuracy and precision), robustness (to photon 
loss and decoherence), and complexity (resources in state gen- 
eration, phase encoding and measurement). Focusing on the 
performance aspect of an ideal interferometer free of losses 
and decoherence may clarify which quantum correlations lead 
to precision enhancement 0,(51] in local and global limits. In 
this paper we examine the intrinsic fitness of various quantum 
states for interferometry independent of any specific estima- 
tion protocol, i.e. irrespective of how the measurement data 
are processed. The fitness criteria introduced will be based on 
the collective information content of the measurement distri- 
bution, and not simply on features like the mean or variance. 

A quantum state of n photons distributed across two spa- 
tial modes a and b is isomorphic to a spin-j particle. A 
two-mode Fock state \n a ) a ® \rib}b is mapped onto \ j,m) z 
where j = (n a + n&)/2 and subscript z denotes that this 
is an eigenstate of J z with eigenvalue m = (n a — rib)/2. 
In terms of creation and annihilation operators, generators of 
unitary transformations representing linear optical elements 
are: J x = ^(a'b + b'a), J y = — — b'a) and J z = 
i(ata— StS). Also, J 2 = h/2(h/2+l) where h= (tfa+ttb) 
is the total photon number. The generators obey commutation 
relations, [J a , Jp] = 2ie a p 1 J 1 and the eigen-equations are 
J 2 \j,m)i = j(j + l)\j,m)i and J l \j,m) l = m\j,m) l for 
% G {x,y, z}. 

Given an input or probe state \ipo), the Mach- 
Zehnder (MZ) interferometer with phase difference 9 
between the arms (FIG0 performs a transformation 
exp{i J x irj2} exp{i J z 9} cxp{— i J x ir/2}, equivalent to a 
rotation (5] through 8 about the y-axis, i.e. cxp{i,J y 9}. 



One infers an unknown 9 by making measurements on the 
output state \rj)g). This task is hampered by the non-existence 
of a Hermitian phase operator 9 in Hilbert space. Even in 
this idealised lossless setting, any estimate </> e of the phase 
has accuracy and precision limited by both the choice of 
|V'o) an d the measurement employed. Conventionally and 
most simply one counts photons in the output arms of the 
interferometer, equivalent to measurement of J z , and this will 
be considered here. Other measurements have been proposed, 
ejj. J 2 01, parity measurements 0], homodyne detection 
Hi, heterodyne detection HI and forms of generalised 
measurement II 3L 111. 



12]. 



The evolution of a pure state in an ideal MZ interferometer 
is governed by the Schrodinger equation, 
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(1) 



for 9 a time-like variable and h = 1, The spin observable 
J y plays the role of Hamiltonian and as such is a conserved 
quantity, its eigenvectors are preserved. The transformed state 

\tpe) is 
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ip m (6)\j,m} z (2) 



in the J z basis. The probability amplitudes ipm(9) are 
r m {9) exp{i(j> m (9)} in polar form, the modulus r m and ar- 
gument cf> m both real-valued functions of 9. The measure- 
ment distribution is the set P(9) = {p m (9)}, where p m (9) = 
\ z (j,m\^ 8 )\ 2 = ^ m {6)i> m {6) = r 2 m (9). These probabili- 
ties lie at the heart of phase estimation - over a data run the 
frequencies of measurement outcomes tend towards the same 
distribution. The full distribution {p m (9)} carries information 
about 9, more than is characterized by its maxima [ 13], or that 
is contained in means and variances, or particular moments of 
J z . This is an important detail when the governing probabil- 
ities p m (9) are multiple-peaked and non-gaussian in 9, as is 
often the case in quantum interferometry 111 411 . 

By matching ratios of observed measurements to a particu- 
lar governing distribution one may infer 9 to within some pre- 
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FIG. 1: Mach-Zehnder interferometer with 2-mode input state \ipo). 
Angle 6 is the relative phase between the two arms. Beamsplitters 
perform ±-| rotations about the x axis. Detectors at the outputs count 
photon numbers n a and tij. 



cision. However, two issues prevent this from being done ex- 
actly and unambiguously. Firstly, the number of experimental 
trials is finite, and may be small, so the set of measured out- 
comes may not be typical of the governing distribution. For 
example, a fair coin tossed four times may still land 'heads' 
four times in a row, an atypical result. Secondly, even for 
typical data sets there may not exist a one-to-one mapping 
«-> P{9) between phases and probability distributions for 
9 £ [0, 2tt). Without such a mapping there is no simple inver- 
sion, P{9) h-> 9. 

What properties of p m (9) might make it more immune to 
these difficulties? Ideally, one asks that p m (0i) 7^ p m (^) and 
that these distributions are somehow 'far apart', in the sense 
that one would be unlikely to mistake one for the other. From 
the Theory of Types fisll . the probability P(2-»i) that a parent 
distribution P (62) gives rise to a data set typical of P(9i) is 
bounded by: 



2-kS[P(9 1 )\\P(9 2 )] 
(k + 1)(2J + 1) 



< 



P{2- 



»i) < 2- kSlP{9l)llP{f>2 ' >] (3) 



for a sequence of k independent measurements. Here 
(2j + 1) is the cardinality of distinct measurement outcomes 
m G {— j, —j + 1, . . . , The non-negative functional 

S[P{6\ ) I |P(#2 )] is known as the relative entropy or Kullback- 
Leibler divergence 



+0 



S[P(0l)||P(0 2 )] = J2 PrM^g 



Pm(9i) 
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quantifying the distinguishability of one distribution from an- 
other. If better- than-even odds were required in 'distinguish- 
ing' distribution P2 from its neighbor Pi after one measure- 
ment, Eq.© imposes a lower bound •S'JPi | IP2] > 1. Rela- 
tive entropy has been employed before J9[] in the context of a 
'maximum likelihood' approach to phase estimation, but now 
we use it to find the intrinsic fitness of states for this task, 
without reference to a particular estimation protocol. 

We propose that a global distinguishability for estimating a 
phase 9, previously known to exist within a finite interval A 
centred on 9 = x be defined as: 

1 /-X+A/2 

£>( Xj a)=— / stPteoiiP^dMfc. (5) 

a Je 1 ,e 2 =x-A/2 

This is the arithmetic mean of relative entropies between all 
pairs of distributions originating within this phase interval. In 



terms of the upper bounds to probabilities in Eq.©, 2~~ kv 
is a geometric mean. Note also that while 5(Pi||P 2 ) 7^ 
S(P2\\Pi), quantity T> is symmetric; it contains the sum 
5(Pi||P 2 ) + 5(P 2 ||Pi) for all probability pairs. 

Parameter A may be considered an a priori precision, and 
\ an a priori estimate; the mean phase across the A interval 
for a uniform prior probability distribution. The size of A has 
a pronounced effect on the properties of T>. Since 9 is a cyclic 
variable, p m (9) = p m (9 -\-2tt) and A < 2tt is a necessary pre- 
requisite to performing the inversion {p m (9)} 1— » 9. In addi- 
tion to translation symmetries there may exist mirror symme- 
tries (depending on the input state) and distinguishability will 
be lower for A intervals inclusive of these symmetry points. 
A strength of our analysis is the freedom to operate beyond a 
restricted neig hborhoodlike 9 w 0, cf. Refs.^HMllJ- 

One may confine 9 to an interval A -C 2ir, given adequate 
prior phase knowledge. In this case it is quite possible for a 
phase uncertainty to be greater after the measurement than be- 
fore. This is because the direct measurement of J z is unsharp 
- such a discretely-valued measurement can only impart par- 
tial information about the continuous phase parameter 9. For 
vanishingly small A ~ 0, then T> approaches a local distin- 
guishability: 



S[P( X -A/2) I \P( x +A/2)] + S[P(x+A/2)\\P(x-A/2)} 
' A 2 
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{ J(x-A/2) + J(x + A/2) } + 0(A J ) 



(6) 



Above, J{9) is the classical Fisher Information |18'] for the 
measurement distribution P(9), 



J{0) 



+3 

m=—j 



\np m (9) 



(7) 



The lower line of Eq.© comes from a series expansion of 
5[P(x±A/2)||P(xtA/2)] about (x± A/2) to second order 
in A. The Fisher information gives a measure of the informa- 
tion contained in the full distribution P{9) about the parameter 
9 Il5ll . It is also the unique distance metric on the manifold of 
probability distributions d. With the knowledge that J{9) 
provides the scaling factor of local distinguishability, we now 
derive an explicit form in terms of the Hamiltonian and the 
measurements. Taking the definition, Eq.© and substituting 
Pm (0)=r 2 m (9), 



+j 



J{9)=A £ f 2 m (0), 



(8) 



m=-j 



where derivatives with respect to 9 are denoted by an overdot. 
If instead one substitutes p m {9) = ^^ n (9)ip m (9), 



+j 



m = E + Hhtf» + 2 ^>™ ■ (9) 



tp-n 



m=—3 

The last term may be evaluated using Eq.(Q]), J2 m ^m^Pm = 

(ipe\ipe) = -i(ipe\Jl x iJy\^e) = (Jy)- The other terms 
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under the summation of Eq.© can be simplified by putting 

ipm = r m cxp{i<^ m }, 



+3 

J{9) = J2 2 ^n ~ IrUl. + %J 2 y 
m=—j 

Comparing Eq.dTol) and Eq.© gives 



(10) 



+3 



J(9)/4 = (J 2 ) J2 
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= (Ji) 



$ 2 



(id 



The final term with subscript c denotes a classical mean with 
respect to distribution (p m (S)}, for the square of a random 
variable 4> taking values <j> m . In analogy with Ref. l20ll the 
Fisher information for any \ij)g) is exactly the discrepancy be- 
tween quantum rotational energy and the rotational energy as- 
sociated with 2j + 1 classical point objects, one for each m 
value. Note that Eq.dTTb holds in general for any hermitian 
Hamiltonian J y t—> H and for spin measurements m made 
along any direction in Euclidean space, z i— > z' . 

From Eq.dTTb we may develop a type of uncertainty relation 
connecting 9, the Hamiltonian and the measurement basis as 
follows. An explicit lower bound on the mean-squared error 
of an unbiased [21] estimate (f> e on the true phase 9 is given 
by the reciprocal of the Fisher information: 



,) 2 > 1/JW) , 



(12) 



called the Cramer-Rao bound [18]. Therefore, for optimal 
9 precision one maximizes the Fisher information 12211 . The 
bound is well-known in information theory, having a general 
applicability not shared by a popular linearised error model, 



d(J z )/d9 



[(J 2 ) - (J,) 2 ] 1 / 2 , fl II E 0| which 
produces incorrect or inconclusive scaling of 8<j) e for certain 
states \ipg) Il4ll7ll . The Cramer-Rao bound provides a proof 
of the 'Heisenberg' precision limit ifJl 0] as follows: For cer- 
tain input states <fr m = and (J 2 ) is bounded from above by 
its largest eigenvalue, i.e. j 2 . Therefore for a state of 2j = n 
photons in a MZ interferometer employing photon counting 
measurements, the optimal scaling of precision with photon 
number is 5<fr e = 1/n. This Heisenberg limit is uniquely 
achieved by an input 



l^o) 



\.h+J)y + 



-3)y 



(13) 



which takes the form of a 'NOON' state 12311 after the first 
beam-splitter. Traversing the phase element 9 and second 
beamsplitter, the state emerges with distribution: 

m , m (1+ (-l) j+m cos{2j(9 + tt/2) - CI) 

Pm{9) = * — ry— — r. (14) 

4 J (j — rn)\{j + my. 

Calculation of Eq.(|7]i gives J(9) = 4j 2 = n 2 , independent 
of 9. This is an exact result, without additional assumptions 
or approximations. So in the context of local distinguishabil- 
ity this NOON state is indeed optimal, and it has the smallest 



lower bound on mean-squared phase error. A caveat is that 
this result is non-constructive: no estimation technique is pro- 
posed that might reach the bound 12411 . And despite this lo- 
cal optimality, the NOON distribution has a periodicity n/j 
or 2ir/n, causing the distinguishability to saturate quickly for 
A > n/j. Thus, NOON states are inappropriate given poor 
prior knowledge about the phase. But in terms of phase sta- 
bility, they are the most sensitive to changes, e.g. in tracking a 
moving target phase after its initial acquisition, provided the 
target is moving slowly enough. 

Let us examine other probe states proposed for interferom- 
etry. For \ip ) i— > \ j, m) z , one can use 2( J 2 ) = ( J 2 ) — (J 2 ), 
and (j) m = to show in this case J(9) = 2[j(j + 1) — m 2 ], in- 
dependent of angle l25ll . Within this family of states the m = 
state l26ll gives the best local distinguishability and preci- 
sion limit via the Cramer-Rao bound, 5<j) e > [2j(j + l)] -1 / 2 ; 
lower than the bound found in 1 12i] for the same state and so- 
called optimal phase measurements. The state is one of equal 
photon numbers injected at both input modes to the interfer- 
ometer, \n/2) a ®\n/2)b, representing the n photon component 
of the two-mode squeezed state: |A) oc J2T=o ^ n \ v )a ® \v)b- 
In contrast, the input state \ j, +j) z returns the 'standard' limit, 
S<f> e > 1/v^ f° r tne Cramer-Rao bound. This is identi- 
cal to the result for n independent experiments carried out 
with a single photon each. It is also the limit for any single 
mode state (e.g. coherent light) combined with the vacuum 
at the MZ input ports, \tp) a <X> |0)t. This is expected because 
\n) a <£> \0)b !— * \ j, +j)z is the n-photon (or spin-j) component 
of such an input state. 



Consider the family of phase states 111 ll 12711 



\j, 7) 



= (15) 



parameterized by real phase 7. The MZ transformation is 
e %ejy \j, 7) = \j, 7 + 0). For local distinguishability, Eq.JTTI) 
gives J(9) = + 1), i.e. phase states have the same 

close-to-optimal scaling as the \j,0) z input, but with a 2/3 
pre-factor. However, FIG|2] illustrates that phase states have 
much better global distinguishability. 

Classical relative entropy is a very useful tool in classify- 
ing the fitness of quantum states for phase estimation in both 
local and global contexts. By interpolating between these two 
regimes via the a priori phase precision A, it has become ap- 
parent that a probe state |^o) displaying strong local distin- 
guishability characteristics may not be optimal in the context 
of global phase estimation, and vice versa. Despite this seem- 
ing trade-off between local and global properties, some input 
states have been shown to be 'jack of all trades', in particular 
the phase states (FIGl2J>. For each probe state the range of A 
leading to optimal distinguishability, e.g. A < ir/j for NOON 
states, may be identified as a new sub-wavelength scale pa- 
rameter. 

The proportionality of local distinguishability to Fisher in- 
formation has been emphasized, and we derived an explicit 
form in terms of a quantum-classical kinetic energy discrep- 
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n=10 n = 20 71=50 5 



-16 -14 -12 -10 

lnX>(x, A = l(r 3 ) 

FIG. 2: Comparing distinguishability ©(x, A) in relatively local 
(A = 10~ 3 ) and global (A = tt) regimes for various inputs \tpo). 
Each scatter point corresponds to a photon number 2j — n G [5, 50], 
shaded according to x G { f - , %- I 7r }- Pentagons depict NOON states 
(£ = 0), triangles |j, 0) z , squares |j, +j) z , and stars denote phase 
states of Eq.lflTt with 7 = it/2. NOON states exhibit the best local 
distinguishability, yet in the A = n domain their T> is small and does 
not improve with photon number. In contrast, \ j, +j) z (correspond- 
ing to only one illuminated input port) has strong global characteris- 
tics but poor local distinguishability. The phase states (stars) seem to 
offer the best compromise performance. Notice that the choice of \ 
has a significant effect on distinguishability for finite A. 

ancy. Using this result a proof of the Heisenberg precision 
bound was given, involving no limiting assumptions of small 
phase or large photon number. 

For phase estimation protocols with specific global and lo- 
cal distinguishability requirements, the framework we have 
presented is a flexible and powerful tool in finding an optimal 
trade-off between sensitivity and photon resources. As our ap- 
proach is based on the properties of measurement probabilities 
it is easily extended to other phase detection technologies be- 
yond the prototypical case we consider. Future work should 
extend the current analysis to a realistic setting incorporat- 
ing effects of dephasing, thermal noise and, most importantly, 
photon losses. 
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